Citations, research topics and active countries in software engineering: A bibliometrics study

نویسندگان

  • Vahid Garousi
  • Mika Mäntylä
چکیده

Context: An enormous number of papers (more than 70,000) have been published in the area of Software Engineering (SE) since its inception in 1968. To better characterize and understand this massive research literature, there is a need for comprehensive bibliometrics assessments in this vibrant field. Objective: The objective of this study is to utilize automated citation and topic analysis to characterize the software engineering research literature over the years. While a few bibliometrics studies have appeared in the field of SE, this article aims to be the most comprehensive bibliometrics assessments in this vibrant field. Method: To achieve the above objective, we report in this paper a bibliometrics study with data collected from Scopus database consisting of over 70,000 articles. For thematic analysis, we used topic modeling to automatically generate the most probable topic distributions given the data. Results: We found that number of papers published per year has grown tremendously and currently 6,000 to 7,000 papers are published every year. At the same time, nearly half of the papers are not cited at all. Using text mining of articles titles, we found that currently the hot research topics in software engineering are: (1) web services, (2) mobile and cloud computing, (3) industrial (case) studies, (4) source code and (5) test generation. Finally, we found that a small share of large countries produce the majority of the papers in SE while small European countries are proportionally the most active in the area of SE, based on the number of papers. Conclusion: Due to large volumes of research in SE, we suggest using the automated analysis of bibliometrics as we have done in this paper. By picking out the most cited papers, we can present the land marks of SE and, with thematic analysis, we can characterize the entire field. This can be useful for students and other new comers to SE and for presenting our achievements to other disciplines. In particular, we see and report the value of such an analysis in situations where performing a full scale SLR is not feasible due to restrictions on time or to lack of exact research questions.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Visualization of the Koomesh journal between 2006 and 2017: A bibliometric study

Introduction: The present study was conducted with the aim of analyzing the bibliometrics of Koomesh, as one of the oldest and most reputable Iranian medical journals. Materials and Methods: This study was conducted using a bibliometric method on the articles published in Koomesh during the years 2006-2017. For this purpose, through advanced search in the Scopus database, 764 papers were extrac...

متن کامل

A Bibliometric and Altmetrics Analysis of Highly Cited Articles in the Field of Infectious Diseases

Background and Aim: Infectious Diseases are among the diseases involved in public health and a high percentage of causes of death worldwide are attributed to these diseases. The purpose of this study was to investigate the status of highly cited articles in the field of infectious diseases based on bibliometrics and Altmetrics indicators. Materials and Methods: This descriptive-analytical rese...

متن کامل

BiBliometric methods for detecting and analysing emerging research topics

This study gives an overview of the process of clustering scientific disciplines using hybrid methods, detecting and labelling emerging topics and analysing the results using bibliometrics methods. The hybrid clustering techniques are based on biblographic coupling and text-mining and ‘core documents’, and cross-citation links are used to identify emerging fields. The collaboration network of t...

متن کامل

Predicting the Impact of Software Engineering Topics: An Empirical Study

Predicting the future is hard, more so in active research areas. In this paper, we customize an established model for citation prediction of research papers and apply it on research topics. We argue that research topics, rather than individual publications, have wider relevance in the research ecosystem, for individuals as well as organizations. In this study, topics are extracted from a corpus...

متن کامل

Differences in Received Citations over Time and Across Fields in China

We analyse and compare the difference in discipline level of the received citations over a period of time and across fields in China by implementing the diachronous methods of bibliometrics. The citations of 896,645 papers from the Chinese Citation Database (1994 to 2013) that comprised four disciplines, namely, Philosophy, Library and Information Science (LIS), Physics, and Mechanical Engineer...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Computer Science Review

دوره 19  شماره 

صفحات  -

تاریخ انتشار 2016